Spectral Noise Tracking for Improved Nonstationary Noise Robust ASR
نویسندگان
چکیده
A method for nonstationary noise robust automatic speech recognition (ASR) is to first estimate the changing noise statistics and second clean up the features prior to recognition accordingly. Here, the first is accomplished by noise tracking in the spectral domain, while the second relies on Bayesian enhancement in the feature domain. In this way we take advantage of our recently proposedmaximum a-posteriori based (MAP-B) noise power spectral density estimation algorithm, which is able to estimate the noise statistics even in time-frequencybins dominated by speech. We show that MAP-B noise tracking leads to an improved noise model estimate in the feature domain compared to estimating noise in speech absence periods only, if the bias resulting from the nonlinear transformation from the spectral to the feature domain is accounted for. Consequently, ASR results are improved, as is shown by experiments conducted on the Aurora IV database.
منابع مشابه
Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging
Noise spectrum estimation is a fundamental component of speech enhancement and speech recognition systems. In this paper, we present an improved minima controlled recursive averaging (IMCRA) approach, for noise estimation in adverse environments involving nonstationary noise, weak speech components, and low input signal-to-noise ratio (SNR). The noise estimate is obtained by averaging past spec...
متن کاملReduction of Highly Nonstationary Ambient Noise by Integrating Spectral and Locational Characteristics of Speech and Noise for Robust ASR
This paper proposes a new multi-channel noise reduction approach that can appropriately handle highly nonstationary noise based on the spectral and locational features of speech and noise. We focus on a distant talking scenario, where a 2-ch microphone array receives a target speaker’s voice from the front while it receives highly nonstationary ambient noise from any direction. To cope well wit...
متن کاملDesign of robust carrier tracking systems in high dynamic and high noise conditions, with emphasis on neuro-fuzzy controller
The robust carrier tracking is defined as the ability of a receiver to determine the phase and frequency of the input carrier signal in unusual conditions such as signal loss, input signal fading, high receiver dynamic, or other destructive effects of propagation. An implementation of tight tracking can be understood in terms of adopting a very narrow loop bandwidth that contradict with the req...
متن کاملNanyang Technological University Model - Based Noise Robust Speech Recognition
Noise robustness is a challenging problem when automatic speech recognition (ASR) system is deployed in real life applications. This report examines techniques to improve the robustness of ASR systems. Particularly, we focus on a group of model-based noise robust techniques, called vector Taylor series (VTS) method, that adapt the acoustic model of ASR systems towards noisy test data using the ...
متن کاملSpeech enhancement based on smoothing of spectral noise floor
This paper presents robust speech enhancement using noise estimation based on smoothing of spectral noise floor (SNF) for nonstationary noise environments. The spectral gain function is obtained by well-known log-spectral amplitude (LSA) estimation criterion associated with the speech presence uncertainty. The noise estimate is given by averaging actual spectral power values, using a smoothing ...
متن کامل